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, , A unified, Bayesian inference of midplane electron temperature and density profiles 

using both Thomson scattering (TS) and interferometric data is presented. Beyond 
the Bayesian nature of the analysis, novel features of the inference are the use of a 
Gaussian process prior to infer a mollification length-scale of inferred profiles and 
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the use of Gauss-Laguerre quadratures to directly calculate the depolarisation term 
associated with the TS forward model. Results are presented from an application 
of the method to data from the high resolution TS system on the Mega-Ampere 
Spherical Tokamak, along with a comparison to profiles coming from the standard 



CN analysis carried out on that system. 

> 

00 

o 

m 



a J greg.vonnessi@anu.edu.au 



1 



I. INTRODUCTION 



The incoherent Thomson scattering (TS) of laser light off of electrons is the foundation 
for standard diagnostics used to measure electron temperature temperature and density 
in fusion devices.^ Recently, the MAST Thomson scattering (TS) system was upgraded to 
measuring 130 points across the midplane at a frequency of up to 240Hz. 2 



In this work, Bayesian methods are used to infer both T e and n e profiles using both TS 
and interferometric data coming from diagnostics on the Mega- Ampere Spherical Tokamak 
(MAST). The primary advantage of this analysis is that an absolute calibration of the 
time-integrated TS laser pulse energy is not required to infer either the thermal electron 
temperature nor density profiles. 



A stochastic model, based on Gaussian processes (GPs)^, is used to model the correla- 
tion between TS observation points. The noise, signal-variance and length-scale associated 
with this model are inferred as nuisance parameters in addition to the T e and n e profiles. 
This provides a strong benefit of the presented approach, in that signal noise is strongly 
decoupled from the inferred profiles; and thus, helps to mitigate the probability of previous 
errors^HSl, where signal noise was given a physical interpretation, from reoccurring. Moreover, 
the standard perturbative methods for approximating the depolarisation term in the TS for- 
ward model, (c.f. Naito, et. al.^j) are bypassed in this analysis, in favour of a more direct 
and flexible numerical integration calculation using Gauss-Laguerre quadratures. Caching 
of depolarisation calculations is employed to negate the higher computational cost of this 
approach. 



The paper is structured as follows. Section ?JTT] gives a brief overview of Bayesian inference 
using diagnostic data and the application of GPs to model the correlation between obser- 
vation points. Next, the TS and interferometric forward models used are presented with 



a description of the depolarisation computation. Section |IV| present profiles inferred for a 
high-performance MAST discharge along with a comparison to profiles calculated from the 
MAST standard analysis. Finally, conclusions and possible extensions are discussed. 



II. OVERVIEW OF BAYESIAN INFERENCE 



The objective of any inference scheme is to statistically infer a vector of model param- 
eters, denoted A, given a vector of diagnostic data and associated uncertainties, ~\L and a 
respectively. In the Bayesian perspective, inference centres around Bayes' formula: 



_ _ n,p(ft|A,<T,,/) p(a) 
p (W) = ^ — v^jy — • (1) 

where I denotes background assumptions. To keep notation uncluttered, / is dropped for the 
rest of the paper, with background assumptions being explicitly indicated where appropriate. 
The parts of Eq. ([I]) and their application/interpretation in the context of diagnostic data 
are well documented in the literature®^ and will not be discussed in detail here. 

TS and interferometer uncertainties are assumed to be pairwise uncorrelated, with each 
observation having an associated likelihood of the assumed form 

V( f i l \\,a)=Af^-F(X),cr 2 ), (2) 

where Af(ji, cr 2 ) is a Gaussian distribution of mean \i and a 2 the variance; J-'(X) is the forward 
model associated with the given diagnostic. Justifications for this form of the likelihood are 
given elsewher e! 8 * 10 ! 

A GP is employed to define the covariance matrix reflecting a spatial correlation between 
TS observation points: 

K i]k := C, 2 exp (~ ( \ T f i)2 ) + Vita, (3) 

where Ri and Rj represent the radial coordinate of the ith and jth TS observation point 
along the midplane; and the k index indicates correspondence to electron temperature or 
density measurements, with a value of or 1 respectively. In Eq. (|3j) Cfe? r fe and % are com- 
monly referred to as hyper-parameters^ non-physical quantities which help characterise the 
prior and/or likelihood in the overall inference. In this analysis, these are scalar quantities 
uniquely associated with each profile: reflects the average radial length-scale over which 
the profile is changing; is the signal variance, which serves to decouple the average profile 
gradient from the length-scale; and % is the average scalar noise on the profile. With the 
expression in Eq. ([3]), the prior for the inference is proportional to 

V(X) = V(f e , rf e , r , n, Co, Ci, Vq, rji) 



ex J\f (f e K^fA M (n e T K^n c ) \{ (l [0j io](Tl)l[o,io](d)l[o,io](ffc)) , (4) 

where M are zero-mean Gaussian distribution with their covariance matrix shown in the 
argument, and l\ a u(x) indicates a uniform distribution of the variable x over the closed 
interval [a, b]. The upper bounds for the Tfc, and rjf, uniform distributions were empirically 
selected so as to not preclude any physically attainable profiles on MAST, as determined by 
analysing 36 different shot/time slices. 

Equation Q shows the prior as a product of uniform and zero-mean Gaussian distribu- 
tions. As the covariance matrices for the Gaussians over T e and n e are themselves charac- 
terised by inferred hyper-parameters, the prior in Eq. Q embodies a stochastic model of 
the spatial correlation between different TS observation points. For a fixed set of hyper- 
parameters, one can think of this prior as favouring T e and n e profiles convolved with Gaus- 
sian kernels (i.e. mollifications^) of fixed widths corresponding to in the inference. As 
the hyper-parameters themselves are inferred with minimal constraint, the analysis is also 
able to infer the average length-scale, signal variance and noise variance for both the T e and 
n e profiles, as intrinsically held by the data. Further details on these points can be found in 
RasmusserP. 



III. FORWARD MODEL 



In the MAST TS system, Thomson scattered light from each observation point is spec- 
trally divided into four bands via a polychrometer. Each filtered band is then focused onto 
an avalanche photodiode (APD), translating the integrated intensity over the spectral band 
into a voltage signal. When integrated over the TS laser pulse length, a quantity is produced 
which is sensitive to both the thermal electron temperature and density at the associated 
observation point: 

V TS = C s n e E L J — dX, (5) 

where Cs encompasses a collection of known, fixed system constants; El is the integrated 
laser energy; 0(A) is the polychrometer response functions; Al is the TS laser wavelength; \ s 
is the wavelength of the scattered photons; 9 is the scattering angle; and S(X S , Xl, 0,T e ) the 
standard Selden expression. 121 ^ 1 The Selden relation relates the intensity of scattered light 
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off of thermal electrons to T e and is standard in the literature; but the relation is recalled 
here for completeness (note the variable transformations in Eqs. (p|-[IO|)): 



S(e,0,2a) = S z (e,6,2a)q(e,6,2a), (6) 



exp(— 2ax) 



-1/2 



A s — Aj m e c 2 sin 9 

e:= — » , 2a := ^— , u=- -, 9 

Xi T e 1 — cos 

( e 2 \ 1/2 1 

X[= l v 1 + 2(l-cos0)(l + e)J ' y:= (x 2 + M 2 )V2' (10) 

where q(e, 6, 2a) represents the relativistic depolarisation correction termP^d! 

For TS systems, the expressions presented by Naito, et. al.^are normally used to approx- 
imate q(e, 9, 2a). In this work, however, Gauss-Laguerre quadratures are utilised to provide 
a more direct and flexible calculation of the depolarisation correction. The fundamental 
difference in this approach is that it is a non-perturbative calculation, which contrasts the 
approximations of Naito, et. alP that are based Taylor expansions. Indeed, changing vari- 
ables according to v = 2a — x) in Eq. (l8j) gives 



q{e,6,2a) = 1 + 



y 



( \ 

exp(-u) f°° exp(-f) 

{ Jo (fe^) 2 +« 2 ) ; ° ((i,+*y^) ' J 



(ii) 

which can be integrated directly using a Gauss-Laguerre quadrature^!. The initial computa- 



tional cost of the quadrature construction is offset by the fact that the integrals in Eq. (11) 



are computed via pre-calculated quadrature poles and weights. Gauss-Laguerre quadratures 



are very accurate/efficient for calculating integrals of the form seen in Eq. (11) and allow for 
arbitrary levels of accuracy to be specified by simply changing the number of quadrature 
points.^ 

The primary issue with a forward model based on Eq. (|5| is that it requires an absolute 
calibration of El, in addition to the calibrations reflected in the value of Cs- Fortunately, 
MAST has a midplane CO2 interferometer that provides a line-integrated measurement of 
n e in the midplane. Using this data, a simple coordinate transformation is employed to con- 
struct the integrated electron density along the TS laser's line of sight (also in the midplane). 



As El is the same for all TS observation points, it can be inferred as a nuisance parameter 
(i.e. an unphysical parameter which is integrated out in the inference of physical model 
parameters) that can also absorb any error in C$, with n e still being well-constrained by 
both TS and the line-integrated interferometeric observation. This serves to greatly reduce 
the errors seen on the inferred profiles, even when El is given a physically unconstraining 



uniform prior, as in this analysis (see {TV). 



IV. RESULTS 

To demonstrate the analysis detailed above, an inference and subsequent comparison is 
made against profiles coming from the standard TS analysis carried out on MAST. This 
comparison is made on discharge 24600 at 280ms, which is a L-mode discharge in a DnD 
configuration with 3.35MW of co-injected NBI heating. The Bayesian inference was carried 
out using a 1000 point Gauss-Laguerre quadrature to calculate q(e, 9, 2a), with posterior mo- 
ments taken from sampling statistics obtained via a specialised implementation of Skilling's 
nested sampling (NS) algorithm detailed elsewhere^ (see Sivia and Skilling^ for details 
on NS). The sampling results were also independently validated by comparison with sam- 
ples generated from a Hamiltonian Markov Chain Monte Carlo (HMCMC) algorithm (see 
MacKajJ^for details on HMCMC). 

Figure [T] shows that the inferred, expectation of the profiles show generally, very good 
agreement with the profiles coming from standard analysis. The most striking difference 
between both sets of profiles is the difference in uncertainties. Indeed, the nature of the 
unified Bayesian inference indicates that the profiles are very well constrained. This can be 
understood by noting that T e and n e are being inferred from four APD measurements (nom- 
inally) at every observation point, which are in addition to the global constraints provided 
by interferometry and the prior, rendering the inference as being strongly over-determined. 
The presence of global constraints and over-constraining local observations make the infer- 
ence very robust against APD errors, mis-calibrations or signal loss. In contrast, the MAST 
analysis uncertainty reflects a maximum entropy result (i.e. with minimal prior assumptions 
relative to the forward model used) and is expected to have higher uncertainties than results 
utilising the prior in Eq. Q (or any analogy thereof). 

As mentioned in £j|IJ this inference yields mollified profiles, along with the associated 
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Electron Temperature vs. Midplane R (a) Electron Density vs. Midplane R (b) 
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FIG. 1. Comparison of inferred T e and n e profiles inferred from TS data for shot #24600 at 280ms, 
with T e and n e profiles corresponding to (a) and (b) respectively. The heavy lines indicate profiles 
inferred using the Bayesian analysis described in ^TIJ The dashed lines marked with diamonds 
and coloured region reflect MAST scheduler output for the corresponding profiles and uncertainty 
respectively. Uncertainties associated with the expectation of the Bayesian inferred profiles are too 
small to resolve visually and have been suppressed from both figures. 

hyper-parameters represented in Eq. (|3]). Thus, the results produced have a different inter- 
pretation (i.e. they need to be understood in relation to their respective hyper-parameters) 
than the profiles from the standard MAST analysis. Indeed, Fig.[l] can only be viewed as 
a sanity check for the global shape/scale of the T e /n e profiles, as the fine-scale structure 
information coming from the Bayesian analysis is largely contained in the inferred hyper- 
parameters. In this context, it is not surprising to see the inferred profiles lying outside 
the errors bars associated with the MAST analysis. One of the main advantages for such 
a comparison, is that one can qualitatively see how much of the profile structure can be 
attributed to a scalar noise term. Quantitatively, one can compare the scalar noise term, 
rjk, of the profile to the random errors modelled for the diagnostic to validate uncertainties 
produced by explicitly propagating error through the forward model itself. Finally, one may 
adjust the prior on any of the hyper-parameters in the inference to reflect a priori knowledge 
of the length-scales or random noise seen in the system; this can even be taken to the point 
where these hyper-parameters are even fixed at particular values. Of course, such priors will 
effect the structure of the inferred profiles; e.g. fixing a relatively small length-scale and 
scalar noise will yield profiles which have a structure closer to those produced by the MAST 
analysis. Exploration of physics using these more informed priors on the hyper-parameters 
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is a current research endeavour. 



Nuisance Parameter (Unit) 


Sampling Expectation w/ 95% confidence interval 


El (J) 
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Co 
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(1.345 x 10-% L SS£ 
(2.296 x 10-i)UJ»S^ 
1 nnfi+ 2 - 822x10 " 3 

l.UUU c in 7v in-3 

— 0.4z(xlU 
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(9 974 x in-2^+i.76lxlo-3 

{J.VI'i X 1U J_ L081xl0 -2 

i 1+3.039X10" 3 
i ' Oii -2.839xl0-3 

(6.650 x 10-3)li353xio-3 



TABLE I. Table of sampling statistics of nuisance parameters used in the inference of T e an n e 
profiles from TS data for discharge #24600 at 280ms. 



Table [I] shows sampling expectations and uncertainties associated with the nuisance pa- 
rameters presented in Eq. ^ and Eq. While the values in Tab. [j] are treated as nuisance 
parameters, they have been included to give the reader some context for the length scales, 
signal and noise variance that are typically encountered with MAST TS data. Again, the 
uncertainties on these values are small relative to the inferred values, as the inference is 
over-constrained to the point where uncertainties due to degeneracies in highly likely model 
parameter configurations are all but eliminated. 

Finally, as the inference outlined in this paper is non-analytic in nature, having approxi- 
mately 270 model parameters for a given discharge, it is slower than the standard analysis. 
Indeed, to calculate statistical moments of the profiles using NS takes approximately thirty 
minutes per inference (on average), on a 2.2GHz processor with 8GB of memory. It is a cur- 
rent research focus to speed up this inference by developing a parallelised version of sampling 
algorithm. 

V. CONCLUSIONS 

A new method for the unified Bayesian inference of thermal electron temperature and 
density profiles has been demonstrated, which is also able to infer length-scales and scalar 
noise parameters intrinsically contained within the diagnostic data. By employing GPs, the 
average length-scale, signal variance and noise variance are inferred as nuisance parameters, 
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which directly yield profiles where noise is minimised. Finally, a new approach to calculating 
the depolarisation correction in the TS forward model is presented, which is simple to 
implement and affords easy adjustment of calculation accuracy. 
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